MOVING PICTURES ENCODING METHOD AND APPARATUS 
BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention is related to a moving pictures 
5 encoding method, and an apparatus that uses same, which are 
applied to digital picture systems and applications, and 
picture databases utilized in a variety of fields, such as 
communications, broadcasting, data storage, and computers, 
and more particularly to a moving pictures encoding method, 
10 and an apparatus that uses same, which are advantageous 
when encoding picture data containing a scene change* 

2 . Description of the Related Art 

^ lb> j §> ^ V In general, the amount of information comprising 

moving pictures itself is huge. For this rga^n, when 

15 encoding moving pictures, redundajxeyaJ-ong a spatial axis 

is removed by using an o^feh^gonal transform (discrete 

cosine transf ornj^^rocess and variable-length encoding. In 

addition ^^r^dundancy along a time axis is removed by 

find^rig the difference between the first and last frames, 

20 ^nd encoding the difference data. 

The constitution of a picture will be described here 

in order to understand the following explanation. 

An interlaced picture, for which every other line of a 

single frame is scanned, and which constitutes an even 

25 field constituting only even-numbered scanning lines, and 

an odd field constituting only odd-numbered scanning lines, 

is currently used in TV formats (NTSC: National Television 
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System Committee of the United States, PAL: Phase 
Alternation by Line of Europe). 

MPEG-2 (Moving Pictures Experts Group) is an encoding 
scheme, which also corresponds to the encoding of 
interlaced pictures, and performs encoding, which takes 
frame/field into consideration for picture structure, and 
inter-picture motion prediction. These will be explained 
hereinbelow. 

With MPEG-2, both frame allocation and field 
allocation are possible for a picture. When a frame is 
allocated as a picture, it is called a frame structure, and 
when a field is allocated, it is called a field structure. 
Encoding is performed by treating a picture as macro-block 
(MB) units. 

Fig. 26 shows a frame structure macro-block MB1, and a 
field structure odd field macro-block MB2, and even field 
macro-block MB3 . Each macro-block (MB) constitutes, for 
example, 16 x 16 pixels. 

With MPEG-2, there is also frame prediction and field 
prediction, which are well-suited to encoding an interlaced 
picture, and one motion vector is used for frame 
prediction, and two motion vectors are used for field 
prediction. 

Here, methods for the above-mentioned reduction of 
temporal redundancy can be broadly divided into 3 encoding 
methods in accordance with the scope of inter-picture 
prediction utilized. 



The first is an intra-picture encoding method, wherein 
encoding is performed within a picture, the second is an 
inter-picture sequential predictive encoding method, 
wherein encoding is performed by also using inter-picture 
forward prediction. In addition, the third is a bi- 
directional predictive encoding method, wherein encoding is 
performed using two-directional inter-picture prediction, 
which combines the forward direction and the backward 
direction. 

A picture by the above-mentioned first encoding method 
is called an I-picture ( Intra-Picture) , a picture by the 
second encoding method is called a P-picture (Predictive- 
Picture), and a picture by the third encoding method is 
called a B-picture (Bi-directionally predictive-Picture) . 

Further, from the standpoint of whether or not each 
type of picture can be used as an inter-picture predictive 
reference picture from another picture, an I-picture and a 
P-picture are referenced, but a B-picture is not 
referenced. Accordingly, an I-picture and a P-picture 
collectively are called a reference picture. 

Furthermore, because the degree of temporal 
correlation is great between each picture of moving 
pictures, more redundancy can be removed from a P-picture, 
which utilizes a correlation with a forward picture, than 
from an I-picture. Further, more redundancy can be removed 
from a B-picture, which utilizes a correlation with a 
backward picture, than from a P-picture. 




That is, viewed from the amount of data of pictures of 
the same picture quality, it is a relationship wherein an 
I-picture > P-picture > B-picture. Also, because a 
reference picture can at the least be used in motion 
5 prediction from another picture, and can constitute the 
original picture from which a predictive frame is 
generated, it is desirable that a reference picture be of 
the highest picture quality possible. 

In this sense as well, a reference picture > non- 
10 reference picture is desirable with regard to amount of 
data. 

However, efficiency cannot be said to be good if 
encoding is performed using only a P-picture or a B- 
picture, which have small amounts of data. This is because 
15 when an error occurs, the error is propagated temporally 
with the inter-picture encoding of a P-picture or B- 
picture. 

Consequently, since this causes problems, it is 
desirable to refresh periodically with an I-picture. 
20 Actually, when encoding using a B-picture, it is common to 
perform encoding by changing the picture type, as shown in 
Fig. 27. 

In Fig. 27, B indicates a B-picture, I indicates an I- 
picture, and P indicates a P-picture, and the same holds 
25 true for the other figures described hereinbelow. Further, 
an arrow of inter-picture prediction signifies that the 
predictive frame of the picture at the tip of an arrow is 
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generated using the picture at the base of the arrow. 

Incidentally, the size of the Group-of -Pictures (GOP) 
in the example of Fig. 2 7 is 12 pictures, that is, I- 
picture refresh is performed every 12 pictures. And the 
5 distance between each reference picture is 3 pictures. 

Furthermore, inter-picture prediction is performed in 
macro-block units, which divide a picture into a plurality 
of blocks as described above, and determines the difference 
between a pixel unit and a block, wherein a coded picture 
10 macro-block is the same size as a reference picture. 

And then, the cumulative sum thereof is treated as a 
prediction error, the block with the smallest prediction 
error is selected, and a predictive frame is generated. 
Next, difference data with an encoded macro-block is 
15 encoded. As the difference, the sum of absolute values of 
a simple pixel difference, the sum of squares of a pixel 
difference, and the like are utilized. 

Here, when an I-picture is put to multiple uses, it is 
undesirable because the same scene is coded at the same 
20 picture quality, thereby increasing the amount of coded 

data. When encoding is performed at a fixed rate, picture 
unit data allocation becomes that much smaller, and picture 
quality deteriorates . 

However, when scene changes and other inter-picture 
25 correlation is low, prediction efficiency deteriorates when 
inter-picture prediction is used, and in some cases, 
picture quality deteriorates even more than when intra- 
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picture encoding is performed. 

Therefore, as a measure against such trouble, there 
has been proposed a method, wherein, even when encoding is 
performed using an I-picture at a certain fixed interval, 
5 when a scene change SC is detected as shown in Fig. 28, the 
immediately succeeding picture is coded using I-picture 
intra-picture encoding. 

When all is said and done, this method is undesirable 
because an I-picture, or an intra-picture coded picture is 
10 generated each time a scene change occurs, increasing the 

43 volume of coded data by that much. 

\n 

ry Further, so as to prohibit to the utmost an increase 

in the frequency of an I-picture, as shown in Fig. 29, when 
a scene change occurs, the count value of the heretofore I- 

ii 

Q 15 picture fixed interval cycle is reset. Then, there is a 
I'* method, wherein GOP is once again reconfigured from the 

i;n 

Q picture at reset time, and encoding is performed using a 

fixed interval cycle I-picture (for example, GOP size = 
12). 

20 It. might also be possible to have a method, wherein, 

when a scene change is generated in a B-picture as shown in 
Fig. 30, intra-picture prediction is held not with the B- 
picture thereof, but by making the reference picture that 
comes immediately thereafter in the input sequence either 

25 an I-picture or intra-frame encoding, and using inter- 
picture prediction from preceding and succeeding reference 
pictures . 
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However, in the case of the example shown in Fig. 30, 
if it is assumed that the encoding process is achieved in 
real-time using hardware, to include input-output apparatus 
in particular, there is absolutely no way of predicting the 
5 cycle at which a reference picture will appear. Therefore, 
picture memory management becomes more difficult than when 
a reference picture appears at a fixed cycle. 

Further, when viewed from the aspect of data 
allocation, changing from a P-picture to an I-picture 
10 causes much less fluctuation of data allocation than 

q 

M changing from a B-picture to an I-picture, making it less 

ry likely that a stream buffer will exhibit an underflow or 

overflow state. As a result, the method of Fig. 3 0 is 
! »3 considered desirable. 

Q 15 However, even a control method like that shown in Fig 

I'f 30 has problems such as those hereinbelow. 

M Here, Fig. 31 is a diagram prepared for explaining the 

problem points in this Fig. 30. To simplify the 
explanation, nothing is shown other than a B-picture, which 
20 is a encoded picture, a forward reference picture, and a 
backward reference picture. 

In general, in the case of a frame structure, in which 
a frame is encoded as a picture, inter-frame prediction and 
inter-field prediction are performed as inter-picture 
25 prediction. 

More specifically, there are 3 forms of prediction in 
frame prediction: forward prediction, backward prediction, 
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and bi-directional prediction, in which prediction is 
performed using a picture that combines predictive pictures 
obtained by predictions in both directions thereof. 
Similarly, in inter-field prediction as well, there are 3 
5 forms of prediction: forward prediction, backward 

prediction, and bi-directional prediction, in which 
prediction is performed using a picture that combines 
predictive pictures obtained by predictions in both 
directions thereof . 
10 Of the reference numbers of the motion vectors shown 

O 

in Fig, 31, the numbers 5, 10 are frame vectors. Numbers 
^ 1,2, 3, 4, 6, 7, 8, 9 are field predictions. Furthermore, 

;< n in Fig. 31, the right field is the odd-numbered field, and 

is =b 

^3 the left field is shown as the even field. 

M 15 In frame prediction, when using forward prediction, 

Ll! 

i;f the vector that is selected is number 5, and when using 

Q backward prediction, the vector that is selected is the 

number 10 vector. When using bi-directional prediction, 

both the number 5 and 10 vectors are used. 
20 Conversely, in field prediction, a predictive frame is 

generated using a motion vector in each of the odd and even 

fields . 

In the case of forward prediction, the B-picture odd 
field generates a predictive frame using either the number 
25 1 or 2 motion vector, and the even field generates a 

predictive frame using either the number 3 or 4 motion 
vector. 
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Similarly, in the case of backward prediction, the odd 
field uses either the number 6 or 7 motion vector, and the 
even field uses either the number 8 or 9 motion vector. In 
the case of bi-directional prediction, the odd field 
5 generates a reference frame by combining a forward 

predictive frame, which is generated using either the 
number 1 or 2 motion vector, and a backward predictive 
frame, which is generated using either the number 6 or 7 
motion vector. 

10 The even field generates a reference frame by 

combining a forward predictive frame, which is generated 
using either the number 3 or 4 motion vector, and a 
backward predictive frame, which is generated using either 
the number 8 or 9 motion vector. 

15 The above is a method of motion prediction utilized in 

current MPEG-2 and other moving pictures encoding schemes . 
However, when a scene change occurs between the odd/even 
fields within an interlaced frame, the algorithms of 
current moving pictures encoding schemes only perform frame 

20 structure encoding, and in field prediction, one field is 
not able to perform forward prediction, while the other 
field performs backward prediction. 

Consequently, in a relevant picture, encoding 
efficiency drops excessively, and picture quality 

25 deterioration becomes conspicuous. Further, since 

prediction is not applied between pictures, if there is an 
increase of macro-blocks, for which intra-picture encoding 
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is performed using a B-picture, the significance of 
creating a reference picture of immediately thereafter 
using an I-picture or intra-picture encoding is also lost. 

SUMMARY OF THE INVENTION 
5 Therefore, an object of the present invention is to 

provide a moving pictures encoding method and apparatus, 
which are capable of achieving encoding efficiency and good 
picture quality encoding even when there exists a picture 
so that a scene change occurs between the odd/even fields 
10 within an interlaced frame* 

;- =^ 

! k j? Further, an object of the present invention is to 

provide a moving pictures encoding method and apparatus, 
| sfl which achieve good encoding efficiency and picture quality 

M in accordance with adding a prediction mode, and 

M 15 appropriately switching encoding schemes, and switching 
= s ™ input data. 

•'e 1 = 

Q As a first constitution of a moving pictures encoding 

method and apparatus of the present invention, which 
achieves the above-mentioned goals, is a method and 
20 apparatus, which performs moving pictures encoding having a 
picture, which treats a frame of input signals of an 
interlaced image as a picture, and which predicts and 
encodes this picture in macro-block units from both a 
forward picture, which is temporally in the past, and a 
25 backward picture, which is in the future, this method and 
apparatus being characterized in that, when encoding is 
performed by predictions from the above-mentioned bi- 
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directional pictures, there is performed motion prediction 
between fields, which treat as a unit a field, which 
divides each picture in macro-block units into either odd 
numbers or even numbers of pixel scanning lines of this 
5 frame, and as a predictive mode, the use of prediction from 
either the forward field motion vector for one field, or 
the backward field motion vector for the other field is 
selected and switched in picture units, a predictive 
picture corresponding to this selected predictive mode is 

10 generated, and the above-mentioned input signals are 
encoded using the generated predictive picture. 

As a second constitution, a moving pictures encoding 
method and apparatus are characterized in that, when 
encoding is performed by a prediction from the above- 

15 mentioned bi-directional picture, there is performed motion 
prediction between fields, which treat as a unit a field, 
which divides each picture in macro-block units into either 
odd numbers or even numbers of pixel scanning lines of this 
frame, and as a predictive mode, predictions from the 

20 forward field motion vector for one field, and the backward 
field motion vector for the other field are constantly 
used, a predictive picture corresponding to the above- 
mentioned predictive mode is generated, and input signals 
are encoded using the generated predictive picture. 

25 Furthermore, the moving pictures encoding method and 

apparatus according to the first constitution is 
characterized in that a scene change of the above-mentioned 
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input signal picture is detected in a field unit divided 
into either odd numbers or even numbers of pixel scanning 
lines of the above-mentioned frame, and relative to a 
picture, in which a scene change exists between fields 
within the above-mentioned frame in the above-mentioned 
input signal picture, a prediction from a forward field 
motion vector for one field, and a backward field motion 
vector for the other field are treated as a selection item. 

Further, as a fourth characteristic, a moving pictures 
encoding method and apparatus, which performs encoding by 
treating an interlaced image as input signals, and treating 
a macro-block as a unit, this moving pictures encoding 
method and apparatus being characterized in that a scene 
change on the above-mentioned input signal picture is 
detected in a field unit divided into either odd numbers or 
even numbers of pixel scanning lines of a frame, and the 
fields of the above-mentioned input signal picture are 
encoded as pictures for a picture, for which inter-picture 
prediction is performed at the least from both a forward 
picture that is temporally in the past, and a backward 
picture that is in the future, and for a picture, in which 
a scene change exists between intra-frame fields in the 
above-mentioned input signal picture. 

As a fifth characteristic, a moving pictures encoding 
method and apparatus, which performs encoding by treating 
an interlaced image as input signals, and treating a macro- 
block as a unit, this moving pictures encoding method and 
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apparatus being characterized in that a scene change on the 
above-mentioned input signal picture is detected in a field 
unit divided into either odd numbers or even numbers of 
pixel scanning lines of a frame, and for a picture, for 
5 which inter-picture prediction is performed from both a 
forward picture that is at least temporally in the past, 
and a backward picture that is in the future, and either 
the above-mentioned forward picture, or backward picture is 
encoded as the same data for a picture, in which a scene 

10 change exists between intra-frame fields in the above- 
mentioned input signal picture. 

Furthermore, as a sixth characteristic, the moving 
pictures encoding method and apparatus according to the 
fifth characteristic, are characterized in that encoding, 

15 which treats the above-mentioned picture as the same data 
as either a forward reference picture or a backward 
reference picture, is performed by fixing the encoding 
prediction direction to one direction frame prediction, 
making the motion vector always equal 0, and also making 

20 the encoding difference data always equal 0. 

And furthermore, as a seventh characteristic, the 
moving pictures encoding method and apparatus according to 
the fifth characteristic, are characterized in that 
encoding, which treats the above-mentioned picture as the 

25 same data as either a forward reference picture or a 

backward reference picture, sets either a forward reference 
picture or a backward reference picture as the current 
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frame picture when setting the coded original picture. 

Further, as an eighth characteristic, a moving 
pictures encoding method and apparatus, which inputs an 
interlaced picture as an input signal, this method and 
apparatus being characterized in that a scene change on the 
above-mentioned input signal picture is detected in a field 
unit divided into either odd numbers or even numbers of 
pixel scanning lines of a frame of this picture, and one 
field is repeatedly outputted relative to a picture, in 
which a scene change exists between intra-frame fields in 
the above-mentioned input signal picture. 

As a ninth characteristic, a moving pictures encoding 
method and apparatus, which performs encoding by treating a 
frame of input signals of an interlaced image as a picture, 
and by treating a macro-block as a unit, this moving 
pictures encoding method and apparatus being characterized 
in that a scene change on the above-mentioned input signal 
picture is detected in a field unit divided into either odd 
numbers or even numbers of pixel scanning lines of a frame, 
and one field of a picture is repeatedly outputted when 
this picture is such that prediction is performed from both 
a forward picture that is temporally in the past, and a 
backward picture that is in the future, and a scene change 
exists between intra-frame fields in these input signals. 

As a tenth characteristic, a moving pictures encoding 
method and apparatus according to any of the first through 
the third characteristics, are characterized in that there 
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is performed motion compensation, which corresponds to 
encoded moving pictures encoding, and which corresponds to 
prediction from a forward field motion vector for one 
field, and a backward field motion vector for the other 
field. 

Further characteristics of the present invention will 
become clear from the hereinbelow description in accordance 
with the figures of the aspects of the embodiments of the 
invention, 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram illustrating a first solution 
principle of the present invention; 

Fig. 2 is a diagram illustrating a second solution 
principle of the present invention; 

Fig. 3 is a diagram illustrating a third solution 
principle of the present invention; 

Fig. 4 is a diagram illustrating a fourth solution 
principle of the present invention; 

Fig. 5 is a block diagram of an example of a common 
constitution of an encoding apparatus to which a solution 
principle of the present invention is applied; 

Fig. 6 is a block diagram showing an example of a 
constitution of the motion searching device 5 in the 
constitution of Fig. 5; 

Fig. 7 is a block diagram of an example of a 
constitution of the sums of absolute values of differences 
arithmetic circuit 51-5m of Fig. 6; 



15 



Fig. 8 is a block diagram of an example of a motion 
prediction decision device 6; 

Fig. 9 is a block diagram of an example of a 
constitution of a motion prediction decision device 6, 
which, as a first embodiment of the present invention, 
realizes the first solution principle of the present 
invention; 

Fig. 10 is a block diagram of an example of a 
constitution of a motion prediction decision device 6 
according to a second embodiment of the present invention; 

Figs. 11A and 11B are diagrams illustrating the 
physical relationship of an odd field and an even field; 

Fig. 12 is a diagram illustrating a third embodiment 
of the present invention; 

Fig. 13 is a diagram illustrating a fourth embodiment 
of the present invention; 

Fig. 14 is a block diagram of an example of a 
constitution of a scene change detecting circuit 65 in the 
embodiment of Fig. 13; 

Fig. 15 is a diagram illustrating a detection 
algorithm of the scene change detecting circuit 65 of Fig. 
13; 

Fig. 16 is a diagram showing the relationship of a 
first field Fl and a second field F2 in the algorithm of 
Fig. 15; 

Fig. 17 is a diagram illustrating a fifth embodiment 
of the present invention; 
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in 



Fig. 18 is a diagram illustrating a sixth embodiment 
of the present invention; 

Fig. 19 is an example of a constitution of the frame 
memory readout circuit 67 of Fig. 18; 
5 Fig. 20 is a diagram illustrating a seventh embodiment 

of the present invention; 

Fig. 21 is a diagram illustrating an eighth embodiment 
of the present invention; 

Fig. 22 is an example of a constitution of the pre- 
10 processing circuit 68 of Fig. 21; 

Fig. 23 is a block diagram of an example of a 
constitution of a decoding apparatus corresponding to an 
encoding apparatus based on the solution principles 
illustrated by Fig. 1-Fig. 4; 
15 Fig. 24 is a diagram illustrating the significance of 

top#f ield#f irst#f lag in field prediction of the present 
invention; 

Fig. 25 is a block diagram of an example of a 
constitution of the bi-directional motion compensating 
20 circuit 105 of Fig. 23; 

Fig. 26 is a diagram illustrating a picture; 

Fig. 27 is a diagram illustrating the changing of a 
picture type when encoding using a B-picture; 

Fig. 28 is a diagram illustrating a method, wherein 
25 the immediately succeeding picture of when a scene change 
SC is detected is encoded using an I-picture or other 
intra-picture encoding scheme; 
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Fig. 29 is a diagram illustrating a method, wherein 
the count value of an I-picture fixed interval cycle is 
reset when a scene change occurs so as to prohibit I- 
picture frequency from increasing as much as possible; 

Fig. 3 0 is a diagram illustrating a method, wherein, 
when a scene change occurs in a picture, the reference 
picture that comes immediately thereafter in the input 
sequence uses either an I-picture or intra-frame encoding; 
and 

Fig. 31 is a diagram for describing the problem points 
in Fig. 29. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Aspects of the embodiments of the present invention 
will be described hereinbelow in accordance with the 
figures. Furthermore, the same or similar parts in the 
figures will be described by appending the same reference 
numeral or reference symbol. 

Fig. 1 is a diagram illustrating a first solution 
principle of the present invention. In the figure, P, B, 
and I, respectively, indicate a P-picture, B-picture, and 
I-picture, as was explained in Fig. 26 through Fig. 31 
above. 

The characteristic of the solution principle of Fig. 1 
lies in the fact that a prediction mode is added at the 
time of a B-picture during frame structure encoding. 
Enabling the carrying out of prediction, wherein one field 
is forward prediction, and the other field is backward 
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prediction, makes it possible to perform prediction 
efficiently . 

In Fig, 1, as is clear from comparison with Fig, 31, 
the B-picture of the odd field performs forward prediction, 
and the B-picture of the even field performs backward 
prediction. That is, prediction is possible when the odd 
field uses either the 1 or 2 motion vector, and when the 
even field uses either the 8 or 9 motion vector. 

Furthermore, in encoding, when a motion prediction 
mode is added, a decodable decoding scheme must be prepared 
even when the prediction mode added thereto is included. 

Next, a second solution principle according to the 
present invention is a method, which does not add a 
prediction mode. Fig. 2 is a diagram illustrating the 
second solution principle of the present invention. The 
second solution principle is a method, wherein, in a B- 
picture, in which a scene change is detected at the least 
between fields within a relevant frame, field structure 
encoding is performed as two field pictures for this frame. 

That is, as shown in Fig. 2, when a scene change is 
detected between the odd/even fields of a B-picture, the 
odd/even fields of the B-picture thereof undergo encoding 
processing as two field pictures. 

So doing enables inter-picture prediction, wherein one 
field of the two fields is forward prediction, and the 
other field is backward prediction, making it possible to 
perform efficient prediction. 
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Furthermore, as a third solution principle of the 
present invention, there is a case, in which a prediction 
mode is not added, and a picture structure is not changed. 
In this case, because it is impossible to perform efficient 
inter-picture prediction, a coded picture is pasted via 
inter-picture prediction to either a forward or a backward 
reference frame, or input signals are manipulated, and 
encoding is performed as a frame that differs from the 
actual original picture. 

This enables encoding which does not reduce encoding 
efficiency. For example, as shown in Fig. 3, in a picture 
(in Fig. 3, picture B), in which a scene change occurs at 
the least between intra-frame fields, a coded frame is 
encoded so that the picture thereof constitutes the same 
data as that of either a forward reference picture or a 
backward reference picture. 

In the case of Fig. 3, encoding is performed so that 
the picture constitutes the same data as a forward 
reference picture. That is, as shown in Table 1, in the 
encoding of input of picture B, encoding is performed so 
that picture B constitutes the same data as picture A. 
However, especially when a plurality of sequential B- 
pictures exist, it is desirable to presume that [picture B] 
is the same data as the closest reference picture. 
Table 1: Third Solution Principle of the Present Invention 



Input 


Picture A 


Picture B 


Picture C 


Encoded Data 


Picture A 


Picture A 


Picture C 
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Further, viewed over time, when another B-picture 
exists between a B-picture to be coded and a reference 
picture, it is desirable to also encode the pertinent B- 
picture as the same data as the reference picture. This is 
to maintain the continuity of a picture when an encoded 
stream is reproduced. 

As a method for encoding a picture as a frame that 
differs from the actual encoded picture, there is yet 
another method according to the present invention. This is 
a method, wherein, when a scene change is detected between 
intra-frame fields as shown in Fig. 4, one of the fields of 
the picture thereof is repeated for the pertinent frame. 

In the scene following repetition in accordance 
therewith, a scene change can be made in frame units. 
Encoding efficiency increases as a result thereof. This 
method of repeating one field can be carried out by looking 
only at input data that is not particularly related to 
encoding. 

Next, embodiments for realizing the above-mentioned 
solution principles of the present invention will be 
described. But prior to the descriptions thereof, to make 
it easier to understand the present invention, an example 
of a common constitution of an encoding apparatus, to which 
a solution principle of the present invention is applied, 
will be described. 

Fig. 5 shows a block diagram of an example of a common 
constitution of an encoding apparatus to which a solution 
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principle of the present invention is applied. The 
sequence of an I-picture, P-picture, and B-picture is 
controlled in accordance with an encoding controller 1, and 
in correspondence therewith , switches SW1, SW2 switch to 
5 either the intra-picture encoding (intra) side, or the 
inter-picture encoding (inter) side. 

An encoder 2 is constituted from an orthogonal 
transform circuit and a quantizing circuit. In accordance 
therewith, spatial compression, and data structure-based 
10 data quantity reduction are possible. An MPEG-coded data 
stream is outputted from the encoder 2 . 

Conversely, the output of the encoder 2 is inputted to 
a decoder 3, and a decoding process corresponding to the 
encoding process of the encoder 2 is performed. Then, the 
15 decoded picture data is stored in frame memory 4, and 
utilized in motion search and motion compensation 
processing . 

An example of a constitution of a motion searching 
device 5 is shown in Fig. 6, and has m-number of sums of 

20 absolute values of differences arithmetic circuits 51-5m, 
into which respective data from m-number of macro-blocks, 
which constitute the reference picture search range shown 
in Fig. 27, are inputted, and a comparator 50 for inputting 
the output of m-number of sums of absolute values of 

25 differences arithmetic circuits 51-5m, and selecting and 
outputting a minimum value. 

Here, the above-mentioned m-number of macro-blocks are 
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produced by shifting horizontally, and vertically one pixel 
at a time in a reference picture search range, having, for 
example, 16 x 16 pixels as one macro-block unit. 

Each pixel of an original picture macro-block is 
inputted in common to the respective input terminals of the 
sums of absolute values of differences arithmetic circuits 
51-5m. 

The sums of absolute values of differences arithmetic 
circuits 51-5m of Fig. 6 have a constitution that is common 
to all, and a block diagram of an example of the 
constitution thereof is shown in Fig. 7. The constitution 
shown in Fig. 7 has absolute value arithmetic circuits 521- 
52N, and difference circuits 511-51N of a number 
corresponding to the number of pixels N constituting a 
single macro-block. 

Furthermore, the constitution has a cumulative adder 
501, which cumulatively adds the outputs of the absolute 
value arithmetic circuits 521-52N. Here, when one macro- 
block constitutes 16 x 16 pixels as described above, N is 
246. 

The comparator 50 in Fig. 6 is inputted with outputs 
from the sums of absolute values of differences arithmetic 
circuits 51-5m, which correspond to the plurality m-number 
of macro-blocks, and selects the output of the sums of 
absolute values of differences arithmetic circuits 51-5m 
with the minimum value. 

That is, in the reference picture search range, the 
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macro-block with the strongest correlation to the original 
picture macro-block is selected, and a motion vector 
(quantity of movement) and sum of absolute values of 
differences are outputted as motion search results. 
5 Furthermore, a motion prediction decision device 6, in 

outputting the minimum value of the sums of absolute values 
of differences between a reference picture and an original 
picture as described above, the minimum value of the sums 
of absolute values of differences in the forward direction, 

10 that is, with a past reference picture, the minimum value 
of the sums of absolute values of differences in the 
backward direction, that is, with a future reference 
picture, and the minimum value of the sums of absolute 
values of differences with reference pictures in both 

15 directions are outputted. 

Furthermore, for sums of absolute values of 
differences with a reference picture, a frame sum of 
absolute values of differences is outputted in the case of 
frame prediction, and a field prediction sum of absolute 

20 values of differences is outputted in the case of field 
prediction. 

Here, the sum of absolute values of differences of a 
frame prediction is the 16 x 16 pixel sum of absolute 
values of differences minimum value in the above-mentioned 
25 example, and sum of absolute values of differences of a 

field prediction is the sum of an odd 16 x 8 pixel scanning 
line sum of absolute values of differences minimum value 
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and an even 16 x 8 pixel scanning line sum of absolute 
values of differences. 

These absolute values of differences are inputted to 
the motion prediction decision device 6- Fig. 8 is a block 
diagram of an example of a constitution of a motion 
prediction decision device 6. There are prediction 
comparators 60, 61, and 62 corresponding to forward 
prediction, backward prediction, and bi-directional 
prediction. Each prediction comparator compares a frame 
prediction sum of absolute values of differences against a 
field prediction sum of absolute values of differences, and 
outputs the smallest thereof. 

Meanwhile, the switching selection of either P-picture 
encoding or B-picture encoding is controlled in accordance 
with control by the encoding controller 1 . At P-picture 
encoding, either the frame prediction sum of absolute 
values of differences, or the field prediction sum of 
absolute values of differences selected by prediction 
comparator 60 is outputted as the prediction result. 

Further, at B-picture encoding, the outputs of 
prediction comparators 60, 61, and 62 are inputted into 
prediction comparator 63, the minimum value thereof is 
selected, and outputted as the output of the motion 
prediction decision device 6. 

The prediction decision types and motion vectors in a 
motion prediction decision device 6 are as shown in Table 2 
hereinbelow. 
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Table 2: Relation Between Prediction Type and Number of 

Vectors 





Forward 
Prediction 


Backward 
Prediction 


Bi-directional 
Prediction 


Predic- 
tion 
Type 


Frame 


Field 


Frame 


Field 


Frame 


Field 


MV 

Count 


1 


2 


1 


2 


2 


4 


MV 

Count/ 
Direc- 
tion 


1 


2 


1 


2 


1 


2 



The output of the motion prediction decision device 6 
is switched at inter-picture encoding to the difference 
circuit 7 by switch SWl in accordance with encoding 
controller 1. Therefore, the difference between the input 
picture data and the prediction result of the motion 
prediction decision device 6 is outputted by the difference 
circuit 7, and inputted to the encoder 2. 

Next, a constitution, which realizes a solution 
principle of the present invention described in Fig. 1 
through Fig. 4 of the present invention will be described 
relative to the common encoding apparatus of the above- 
mentioned Fig. 5 through Fig. 8. 

Fig. 9 is a block diagram of an example of a 
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constitution of a motion prediction decision device 6, 
which, as a first embodiment of the present invention, 
realizes the above-mentioned first solution principle of 
the present invention. The present invention is 
5 characterized by the fact that it provides yet another 

prediction comparator 64 relative to the constitution of 
the motion prediction decision device 6 of Fig* 8. 

Frame structure encoding realized by the embodiment 
shown in Fig. 9, as the first solution principle of the 
,.pt 10 present invention illustrated in Fig. 1, adds a predictive 
; E S mode, which, in B-picture field prediction, performs 

prediction from the forward direction for one field, and 
j*^ prediction from the backward direction for the other field. 

^ To add this predictive mode, there is a possibility 

| ! 1 15 that an additional bit will be required in a macro-block 
!!s unit from the standpoint of the syntax (data string 

convention). Therefore, this predictive mode is 
advantageous in that the constitution enables ON/OFF in 
picture units . 

20 As a method for same, there is selected motion 

prediction, which performs prediction from the forward 
direction for one field, and from the backward direction 
for the other field, just like the frame prediction and 
field prediction performed by the motion prediction of 

25 heretofore, and which also computes the prediction error of 
1 direction 1 vector field prediction, and shows the 
minimum prediction error via prediction comparator 63. 
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Furthermore, the switch SW in Fig. 9 is provided for 
switching in accordance with whether or not a prediction 
mode, wherein prediction is performed from the forward 
direction for one field, and from the backward direction 
for the other field, is to be applied in picture field 
prediction, which uses the bi-directional prediction of the 
present invention . 

Here, the prediction types and number of vectors in 
the embodiment of Fig. 9 are as shown in Table 3 
hereinbelow. 



Table 3 : Relation Between Prediction Type and Number of 

Vectors 





Forward 
Prediction 


Backward 
Prediction 


Bi-directional 
Prediction 


Predic 

tion 

Type 


Frame 


Field 


Frame 


Field 


Frame 


Field 


Field2 


MV 

Count 


1 


2 


1 


2 


2 


4 


2 


MV 

Count/ 
Direc- 
tion 


1 


2 


1 


2 


1 


2 


1 



In Table 3, field prediction on the bi-directional 
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side is conventional, the same as in Table 2. Field 2 
prediction on the bi-directional side is added in 
accordance with the present invention, and there is 1 
motion vector per direction. Field 2 prediction therefore 
has 2 motion vectors for forward/backward prediction. 

Fig. 10 is a block diagram of an example of a 
constitution of a motion prediction decision device 6 
according to yet a second embodiment of the present 
invention. The constitution is such that the output of 
prediction comparator 64, which is added relative to the 
embodiment of Fig. 9, is constantly inputted to prediction 
comparator 63. 

Therefore, the constitution is such that, for a B- 
picture, prediction, which is always from a forward field 
prediction result for one field, and from a backward field 
prediction result for the other field, is entered into an 
item. 

As described above, according to the present 
invention, a prediction mode, wherein there is 1 motion 
vector per direction in field prediction, is added as a new 
prediction mode. 

Table 4 is a code table for an MPEG-2 frame structure 
prediction mode (Table 6-17 Meaning of f rame#motion#type : 
ISO/IEC 13818-2 (MPEG-2 Video) definition of 
f rame#motion#type) . 
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Table 4 



Code 


Spatial 
Temporal 
Weight 
class 


Prediction 
Type 


Motion 
vector 
count 


Mv format 


dmv 


00 




Reserved 








01 


0,1 


Field- 
based 


2 


Field 


0 


01 


2,3 


Field- 
based 


1 


Field 


0 


10 


0,1,2,3 


Frame- 
based 


1 


Frame 


0 


11 


0,2,3 


Dual-Prime 


1 


Field 


1 



Here, to realize the addition of a prediction mode 
without increasing code bits at all, as shown in Table 5 
hereinbelow, this prediction mode can be allocated to code 
'00', which is allocated to 'reserved' of Prediction type 
of the above Table 4. 



Table 5 



00 


0,1 


Field- 


1 


Field 


0 






based 









Here, in the mode to be added, it is possible to have 
a case in which the odd field (top# field) is forward 
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M 
i y 

i.3 



prediction, and the even field (bottom#f ield) is backward 
prediction, and, contrary thereto, a case in which the even 
field (bottom# field) is forward prediction, and the odd 
field (top# field) is backward prediction, 
5 To increase motion prediction efficiency when a scene 

change occurs between intra-frame fields, the field that 
exists first in terms of input time can perform forward 
prediction, and the field that exists thereafter in terms 
of input time can perform backward prediction. 
» 10 With MPEG-2, because an interlaced picture constitutes 

1 frame, a 1-bit flag called top#f ield#f irst#f lag, which 
exists in Picture encoding extension, can be referenced to 
determine which field exists first temporally. 

That is, when the top#f ield#f irst#f lag = 1, Fig. 11A 
15 pertains, and the odd field can perform forward prediction. 
By contrast, when top#f ield#f irst#f lag = 0, Fig. 11B 
pertains, and the even field can perform forward 
prediction . 

In other words, since the field that is to perform 
20 forward prediction using the newly added field prediction 
is uniformly decided by the top#f ield#f irst#f lag, which 
exists in Picture encoding extension, there is no need to 
provide a new flag. 

Further, a motion vector is also a field prediction, 
25 and since the vector count is 1 vector for one field, a 
motion vector can be sent just like in frame prediction. 
In the above-described second embodiment, since no 
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additional bits are required from the standpoint of the 
MPEG-2 syntax, it is desirable for this prediction to be ON 
at all times. 

Fig. 12 is a diagram illustrating a third embodiment 
of the present invention. The constitution of Fig. 12 is 
such that, relative to the constitution of Fig. 9, the 
switching of the switch SW is controlled by a field scene 
change detecting circuit 65. 

Now, in relation to the above description of the 
second embodiment, the '00' of the code column in the above 
Table 3 becomes a 'reserved' bit in order to make the MPEG 
start code, which commonly holds a 0 in bit 2 3 and a 1 in 
bit 1, a unique word. 

Therefore, when it is impossible to allocate a new 
motion prediction type to '00', either the bits of a 
f rame#motion#type must be increased 1 bit, or a new, at the 
minimum 1-bit mode identification bit must be provided. 

In this case, since motion prediction is performed in 
macro-block units, for example, even if an additional bit 
is 1 bit, when conversion is carried out per picture, it 
constitutes a huge increase in bit quantity. Furthermore, 
it is very possible that efficiency is raised in accordance 
with the addition of this bit only in a picture in which a 
scene change occurs between intra-frame fields. 

Therefore, in the third embodiment, when an ON/OFF 
flag of an additional motion prediction mode in the first 
embodiment is provided, a scene change occurs between 
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intra-f rame fields, and the picture thereof is a B-picture, 
the flag thereof is set to ON. 

As shown in Fig. 12, a scene changing circuit 65 is 
provided as a circuit that generates a field scene change 
detection flag. In accordance therewith, in a B-picture in 
which a field scene change occurs, there is added a 
prediction mode such that, using field prediction, 
prediction is performed from the forward direction for one 
field, and prediction is performed from the backward 
direction for the other field. 

In accordance with so doing, the additional bits of 
the code quantity in accordance with a mode addition in a 
B-picture in which a field scene change does not occur is 
at the most 1 bit, making it possible to hold the increase 
of the code quantity in check. 

Fig. 13 is a diagram illustrating a fourth embodiment 
of the present invention. Relative to the constitution of 
Fig. 5, there is provided a switching circuit 66, which 
performs the switching of the frame structure and field 
structure on the encoding input data side. The switching 
circuit 66 performs switching of the frame structure and 
field structure when the scene change detecting circuit 65 
detects a case in which a scene change occurs between 
intra-f rame fields, and the picture thereof is a B-picture. 

When field structure encoding is performed, it is 
possible, for example, to perform forward prediction for 
one field picture, and backward prediction for another 
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field picture, and prediction efficiency does not drop 
significantly . 

Fig. 14 is a block diagram of an example of a 
constitution of the scene change detecting circuit 65 in 
the embodiment of Fig. 13. In the figure, the pixel number 
is the order in which a field is raster scanned, and N field 
signifies the number of pixels in a field. 

The scene change detecting circuit 65 is constituted 
having difference circuits 651-65N, which determine the 
difference between each pixel of the current field and 
preceding field in field units, absolute value circuits 
661-66N, which determine the absolute values of the outputs 
of the difference circuits, a cumulative adder 650, which 
cumulatively adds the outputs of the absolute value 
circuits 661-66N, and a threshold decision circuit 660, 
which compares and determines the output of the cumulative 
adder 650 against a threshold TH. 

That is, when the cumulative addition value of the 
absolute difference of 1 field exceeds the threshold TH, a 
discrimination decision output is generated from the 
threshold decision circuit 660. 

Fig. 15 is a diagram illustrating a detection 
algorithm of the scene change detecting circuit 65, which 
detects a field scene change on the basis of a 
discrimination decision output from this threshold decision 
circuit 660. Fig. 16 is a diagram showing the relationship 
of a first field Fl and a second field F2 in the algorithm 



34 



of Fig. 15. 

The algorithm of Fig. 15 will be described with 
reference to this Fig. 16. The presence or absence of a 
scene change is detected in the first field Fl, which is 
5 determined by the value of the top#f ield#f irst#f lag (Step 
SI). When a scene change is not detected in the first 
field Fl, the presence or absence of a scene change is 
detected in the second field F2 (Step S2 ) . 

At this time, when Step S2 determines that a scene 
, 8a . 10 change exists, the scene change is between the first field 
?i Fl and the second field F2 , as shown in Fig. 16 (Step S3). 

; s 5 Therefore, in this case, the scene change is determined to 

|; x s be a field scene change (FIC) . 

" Sa? Further, when Step S2 determines that a scene change 

,' B ^ 15 does not exist, a determination is made that there is no 
;!!! scene change (Step S5). 

!= 5 When there is a scene change in Step SI, next the 

presence or absence of a scene change is determined for the 
second field (Step S4 ) . When there is no scene change in 

20 the second field, a determination is made that there is no 
scene change (Step S5). When there is a scene change in 
the second field, since this constitutes the fact that a 
scene change extends into the first field as shown in Fig. 
16, the scene change is determined to be a frame scene 

25 change FRC in this case (Step S6). 

Fig. 17 is a diagram illustrating a fifth embodiment 
of the present invention. This embodiment is characterized 
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in that the motion prediction mode is limited. 

When a scene change occurs between intra-frame fields, 
and the relevant picture is a B-picture as was described 
above with regard to the third solution principle, this 
5 embodiment realizes the encoding of an encoded frame as the 
same data as either a preceding or succeeding reference 
frame . 

In the embodiment of Fig. 17, the constitution is such 
that the direction of motion vector prediction is fixed in 
I= _ 10 one direction, frame prediction is fixed, the motion vector 
'}£ is fixed at vector 0, and encoding control is carried out 

j tj so that even encoding coefficient data is not sent. 

In accordance with so doing, an encoded picture can be^ 

' iB? encoded as the same data as the locally-decod^d^^ame^f^a 

: := : = 15 reference picture in accordance with^tfie solution principle 
illustrated in Fig. 3. Theiaofion vector can also be fixed 
4 after all motion modp^detection has been performed, and the 

constitution c^n also be designed so that no motion 
predict iop^mode operation is performed once this mode has 
20 been pntered. 

According to such a constitution, it is possible to 
prevent encoding efficiency from becoming extremely poor. 
In this case, because encoding is performed by treating an 
encoded picture the same as a reference picture that either 
25 comes before or after a scene change, actually a completely 
different picture is encoded for one field either before a 
scene change or after a scene change. 
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Further, with regard to the reference picture that is 
treated as the same data as a coded picture, the use of a 
reference picture that is closer, input time-wise, to a 
picture in which a scene change occurs is believed to be 
better, but it need not be limited to this. 

Further, when a separate B-picture exists between a 
scene change-picture and a reference picture, also treating 
the pertinent B-picture as the same data as a reference 
picture is desirable from the visual and encoding 
efficiency standpoints . 

Fig. 18 is a diagram illustrating a sixth embodiment 
of the present invention. This embodiment is characterized 
by the fact that encoding input data is switched. In the 
constitution of Fig. 18, a frame memory readout circuit 67 
is provided on the data input side, and as a method for 
treating a coded picture as the same data as a reference 
picture as in the above-mentioned constitution of Fig. 17, 
when setting encoding data from an inputted frame, the 
reference picture to be treated as the same data is set. 

Fig. 19 is an example of a constitution of the frame 
memory readout circuit 67 of Fig. 18. Furthermore, Fig. 19 
shows a readout circuit for field 1 input, and since the 
same constitution is used for field 2, the constitution 
therefor is omitted from the figure. 

The constitution of Fig. 19 has a switch, which is 
switched in accordance with a scene change detection signal 
from a scene change detecting circuit 65, and when there is 
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a B-picture, and a field scene change, the same data as 
that of either a preceding reference picture or a 
succeeding reference picture is outputted in accordance 
with the switch SW. A counter 670 generates the timing for 
switching I, P, and B-pictures . 

With MPEG-2, since a motion search is generally 
performed here using original pictures, an original of a 
reference picture can be stored in memory. Further, a 
post-motion compensation prediction picture is achieved 
using a locally-decoded picture resulting from the one-time 
encoding/decoding of a reference picture. Consequently, a 
decoded reference picture can also be stored in memory. 

Therefore, the reference picture set as an original 
picture here can be set as either an original 
picture/locally-decoded picture. Further, since the coded 
frame is changed, the limits of the encoding method shown 
in the embodiment of Fig. 17 are not particularly 
necessary. 

Here, since the same data as that of a reference 
picture is used in encoding in the embodiments of Fig. 17, 
Fig. 18, practically no data is generated in the pertinent 
picture. Consequently, there is also an effect, whereby 
data can be allocated to another picture within the GOP. 

Fig. 20 is a diagram illustrating a seventh embodiment 
of the present invention- With this embodiment, the 
present invention can be applied without necessarily 
linking up to an encoding apparatus. That is, this 
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embodiment can be used as an input picture switching 
apparatus capable of being used in common applications. 

Data of field 1 and field 2, which constitute 1 frame, 
are inputted, and a scene change detecting circuit 65 
immediately detects a scene change between intra-frame 
fields when there is a scene change therebetween. 

When the scene change detecting circuit 65 detects the 
occurrence of a field scene change, a switching signal is 
generated, a switch SW is switched to the terminal 1 side, 
and the same data as that of field 1 is outputted for field 
2 . 

In this manner, when a field scene change is detected, 
a field is repeated in the pertinent frame, and the input 
frame is remade. By repeating one field, it is possible to 
artificially change the location of a scene change between 
intra-frame fields to an inter-frame scene change in 
accordance with the solution principle illustrated in Fig. 
4. 

Fig. 21 is a diagram illustrating an eighth embodiment 
of the present invention. This embodiment is a 
constitution, which applies the embodiment of Fig. 2 0 to an 
encoding apparatus. The embodiment of Fig. 2 0 is used as 
the pre-processing circuit 68 of Fig. 21. 

Fig. 22 is an example of a constitution of the pre- 
processing circuit 68 in Fig. 21. This embodiment is 
constituted such that, when a scene change detecting 
circuit 65 detects a field scene change, field 1 data is 



39 



repeated and outputted for field 2 in accordance with a 
switch SW. Furthermore, the function of a counter 670 is 
the same as that described in the embodiment of Fig. 19. 

In actual encoding in accordance with this embodiment, 
processing is performed at a scene change between intra- 
frame fields in a B-picture for which encoding efficiency 
is poor. If a scene change is an inter-frame scene change, 
predictive encoding can be efficiently performed for a B- 
picture from preceding and succeeding reference pictures by 
simply using an I-picture or intra-picture encoding on a 
reference picture immediately following, input time-wise, a 
scene change. 

The thinking behind the eighth embodiment is similar 
to that of the embodiment of Fig. 18 in the sense that a 
coded picture is encoded as a different picture from the 
actual inputted picture, but in the embodiment of Fig. 21, 
when a scene change occurs between fields, a frame is not 
replaced with a reference picture, but rather both fields 
within the frame are made to have the same data by 
repeating one field in an inputted original picture as 
illustrated in Fig. 4. 

Fig. 23 is a block diagram of an example of a 
constitution of a decoding apparatus corresponding to an 
encoding apparatus based on the solution principles 
illustrated in accordance with Fig. 1-Fig. 4 above. 

A signal, wherein the output of a decoder 100 is added 
to a motion compensation signal either directly or via an 
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adder 101 by switching a switch SW, is outputted as a 
decoded signal. 

To generate a motion compensation signal, this 
decoding apparatus has a fore-picture memory 102 and an 
aft-picture memory 103, and also has a forward motion 
compensating circuit 104 , a bi-directional motion 
compensating circuit 105, and a backward motion 
compensating circuit 106, 

In Fig, 23, the solid line arrows for data read out of 
memory 102, 103 indicate frame prediction, and the broken 
line arrows indicate field prediction. And the types 
thereof can be switched in each macro-block. 

Here, the macro-block (MB) data in an MPEG stream 
outputted from an encoding apparatus comprises MB location, 
prediction type, which distinguishes between forward, 
backward and bi-directional prediction, motion vectors, and 
encoding coefficient data. 

In the current MPEG-2 standard, the number of motion 
vectors at frame structure encoding, as shown in Table 1, 
is 1 vector for one direction in the case of frame 
prediction, and 2 vectors for one direction in the case of 
field prediction. 

The data on the stream in this case is as follows . 

In the case of frame prediction of one (forward or 
backward) direction prediction: 

prediction type + motion vector 1 

In the case of field prediction of one (forward or 
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# 



backward) direction prediction: 

prediction type + motion vector 1 + motion vector 2 
The above-mentioned motion vector 1 is a top# field 

(odd field) vector, and motion vector 2 is a bottom#field 
5 (even field) vector. 

In the case of frame prediction of bi-directional 

prediction: 

prediction type + motion vector 1 + motion vector 2 
Motion vector 1 is a forward* frame vector, and motion 
10 vector 2 is a backward* frame vector. 

Furthermore, in the case of field prediction of bi- 
directional prediction: 

prediction type + motion vector 1 + motion vector 2 + 
motion vector 3 + motion vector 4 
15 Motion vector 1 is a f orward#top#f ield vector, motion 

vector 2 is a f orward#bottom#f ield vector, motion vector 3 
is a backward#top#f ield vector, and motion vector 4 is a 
backward#bottom# field vector. 

For bi-directional field prediction according to the 
20 present invention, motion vectors are one direction 1 
vector, and constitute 

prediction type + motion vector 1 + motion vector 2 
Motion vector 1 is a forward#f ield vector, and motion 
vector 2 is a backward* field vector. 
25 This can be taken in 2 ways, that is, motion vector 1 

is a forward#top#f ield vector, and motion vector 2 is a 
backward#bottom# field vector, or motion vector 1 is a 
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forward#bottom# field vector, and motion vector 2 is a 
backward#top#f ield vector, and which way is taken is 
determined by a top#f ield#f irst#f lag . 

Fig. 24 is a diagram illustrating the same concept as 
that of Fig. 11. Fig. 24 is a diagram showing the modes of 
motion vector 1, and motion vector 2, which are determined 
by the top#f ield#f irst#f lag . That is, when 
top#f ield#f irst#f lag = 1 , as shown in Fig. 24A, motion 
vector 1 is a forward#top# field vector, and motion vector 2 
is a backward#bottom# field vector. 

Conversely, when top#f ield#f irst#f lag =0, as shown in 
Fig. 24B, motion vector 1 is a forward#bottom# field vector, 
and motion vector 2 is a backward#top# field vector. 

Fig. 25 is a block diagram of an example of a 
constitution of the bi-directional motion compensating 
circuit 105 of Fig. 23. 

In the case of frame prediction, a macro-block of a 
frame constitution of a location equivalent to each motion 
vector is read out from fore-picture memory 102, and aft- 
picture memory 103, and h the sum of the addition thereof 
is calculated, and outputted in accordance with an adding 
circuit 251, and a h circuit 252. 

In the case of field prediction, 4 h macro-blocks, 
which are equivalent to the locations of 4 motion vectors, 
are read out from memory 102, 103. These are a fore- 
picture odd field h macro-block, a fore-picture even field 
h macro-block, an aft-picture odd field h macro-block, and 
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an aft-picture even field h macro-block. 

Next, the odd field h macro-block of the fore-picture, 
and the odd field h macro-block of the aft-picture are 
inputted, and h the sum of the addition thereof is 
calculated, and outputted in accordance with an adding 
circuit 253, and a h circuit 254, 

Further, the fore-picture even field h macro-block, 
and the aft-picture even field h macro-block are inputted, 
and h [the sum] of the addition thereof is calculated, and 
outputted in accordance with an adding circuit 255, and a h 
circuit 256. 

Furthermore, in the case of field prediction according 
to the present invention, a determination is made on the 
basis of the above-mentioned top#f ield#f irst#f lag whether a 
combination of a fore-picture odd field h macro-block and 
an aft-picture even field h macro-block, or a combination 
of a fore-picture even field h macro-block and an aft- 
picture odd field h macro-block, which is equivalent to 2 
motion vectors, is read out from memory 102, 103. The 
decided combination is outputted as a macro-block from a 
selection circuit 257. 

As described above, according to the present 
invention, even when scene change detection occurs during 
frame structure encoding in a B-picture, and in a place 
that is not a break in a frame picture, efficient encoding 
can be performed without a deterioration of prediction 
efficiency . 
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In each of the above embodiments, operation was 
described using the constitution of the hardware block, but 
the present invention is not limited to hardware, and a 
corresponding function can also be achieved using software. 

Furthermore, examples of each of the above embodiments 
were provided for an understanding of the present 
invention, but the scope of the protection of the present 
invention is not limited thereto. So long as same does not 
depart from the idea of the present invention, a moving 
pictures encoding method and apparatus that is equivalent 
to a constitution in the claims is also included in the 
scope of protection of the present invention. 
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